Talend Big Data Basics

SubscriptionThis content is available for Talend Academy subscription users.Instructor-ledThis content is available as instructor-led training. Open learning plan - EN Open learning plan - FR

 

Talend provides a development environment that enables you to interact with many big data sources and targets without having to understand or write complicated code.

 

Talend Big Data Basics is an introduction to the Talend components shipped with several products that interact with big data systems.

 

Duration: 2 days (14 hours)

 

Target audience: Anyone who wants to use Talend Studio to interact with big data systems

 

Prerequisites: Completion of Introduction to Talend Studio, Talend Data Integration Basics, or Talend Data Integration Advanced

 

Badge: Complete this learning plan to earn the Talend Big Data Developer Practitioner badge. To know more about the criteria to earn this badge, refer to the Talend Academy Badging Program page.

 

Learning objectives: After completing this learning plan, you will be able to:

  • Create cluster metadata

  • Create HDFS and Hive metadata

  • Connect to your cluster to use HDFS, HBase, Hive, Pig, and MapReduce

  • Read data from and write it to HDFS (HDFS, HBase)

  • Read tables from and write them to HDFS (Hive)

  • Process tables stored in HDFS with Hive

  • Process data stored in HDFS with Pig

  • Process data stored in HDFS with Big Data Batch Jobs

 

Training modules: To complete the learning plan, take the following training modules: